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Abstract 

We present a fast multiscale approach for the network minimum logarithmic arrange- 
ment problem. This type of arrangement plays an important role in a network compression 
and fast node/link access operations. The algorithm is of linear complexity and exhibits 
good scalability which makes it practical and attractive for using on large-scale instances. 
Its effectiveness is demonstrated on a large set of real-life networks. These networks with 
corresponding best-known minimization results are suggested as an open benchmark for a 
research community to evaluate new methods for this problem. 
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Finding a suitable compressed representation of large-scale networks is intensively studied in 
both practical and theoretical branches of data mining [121 12 [HI [26]. In particular, the success 
of applying some of the recently proposed compression schemes [HI G3, [3] strongly depends on the 
"compression-friendly" arrangement of network nodes. Usually, the goal of these arrangements 
is to order the nodes such that the endpoints of network links (edges) are located as close 
as possible. Doing so leads to a more compact representation of links and allows a better 
J> performance of compression schemes and network element access operations. 

In pi], Chierichetti et al. propose a combinatorial optimization problem, namely, the min- 
imum logarithmic arrangement problem, that seeks a nearly optimal information-theoretical 
compressed encoding size for all network links. This is achieved by ordering the network nodes 
and assigning to them unique integer values (ids) such that the endpoints of a link will obtain 
close values. The problem has been proven to be NP-hard, and two heuristics for ordering the 
social networks were given. In this paper, we present a multiscale method for approximating 
a generalized link-weighted and node-volumed version of the minimum logarithmic arrangement 
. for general networks. The importance of a link- weighted property (when each link is assigned 

by a nonnegative weight) for this problem is twofold. First, the link weight can measure the 
significance of that link. For example, in many cases we have information regarding its access 
frequency, and we would prefer that frequently accessed links would be compressed better. Sec- 
ond, the multiscale algorithmic framework admits a natural aggregation of weighted links at 
different scales of the problem representation (discussed later). Similary to the link weights, all 
nodes are assigned by nonnegative volumes that represent the length of their segments captured 
in the ordering of the network nodes. 

Our multiscale algorithm is based on the algebraic multigrid (AMG) [8] methodology for 
linear ordering problems 123] . The main objective of the AMG-based framework is to con- 
struct a hierarchy of problems (coarsening), each approximating the original problem, but with 
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fewer degrees of freedom. This is achieved by introducing a sequence of successive projec- 
tions of networks' graph Laplacians into lower-dimensional spaces and solving the problem in 
them. The multiscale framework has two key advantages that make it attractive for applying 
on modern large-scale instances: it exhibits a linear complexity, and it can be relatively easily 
parallelized and implemented by using standard matrix- vector operations. Another advantage 
of the multiscale framework is its heterogeneity, expressed in the ability to incorporate external 
appropriate optimization algorithms (as a refinement) in the framework at different scales. For 
more detailed surveys on the multiscale methods and their parallelization for combinatorial 
optimization problems, we refer the reader to [9] \27\ 17]. 

In contrast to [11] , the main goal of this work is to provide a generic solver for the minimum 
logarithmic arrangement for general networks. Social networks (including big parts of Web 
graph) commonly consist of a combination of structural properties (such as degree distribution, 
small diameter and expander-like topology) that enable fast greedy methods (usually based 
on some preferential ordering of graph traversal) to find arrangements for further high quality 
compression. These methods can be successful because most of the links exhibit good locality 
|X8|, [28] and only a small number of them can be considered as global edges that connect well- 
separated nodes/regions of a network. Thus, these networks can hardly be considered as strongly 
irregular and difficult instances. In real life, however, there occur many situations when the 
structure of a stored network (or a part of it) is complex and irregular (such as decision/detailed 
supply /infection spread networks and even parts of social networks that do not exhibit power 
law degree distribution), which creates many contradictions between greedy local decisions and 
solutions that consider a more global picture. We address this type of problem by introducing 
a multiscale solver. 

The experimental part of this work shows how far are the existing state-of-the-art ordering 
heuristics from being optimal. We demonstrate significant improvement for minimization of 
logarithmic arrangement for various families of networks (including social networks and other 
(ir)regular instances). The comparison was performed with several recently introduced methods 
in [U El EMU]. In almost all cases, our solver exhibited better numerical results than previous 
best-known results. 

We introduce notation and necessary definitions in Section [2] The main algorithm is de- 
scribed in Section [3j The computational results and discussion about different parameter set- 
tings are presented in Section [4] In Section Section [5] we conclude and provide possible future 
research directions. For readers interested in their own implementation, we note that some 
details regarding the multiscale scheme (that are not related strongly to the discussed problem) 
are omitted in this paper; they can be found in [22l [231 Ell E]. 

2 Notation and problem definition 

A network is described by a weighted directed graph G = (V, E), where V = {1, 2, n} is the 
set of nodes (vertices) and E is the set of directed edges. If ij G E, then there exists an edge 
Denote by w%j the non- negative weight of the directed edge ij between nodes i and j; if 
ij ^ E, then w%j = 0. Let ir be a bijection 



The purpose of the link-weighted version of the minimum logarithmic arrangement problem 
(MLogA) is to minimize 
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over all possible permutations tt. The base of a logarithm (lg) will be always 2. We define the 
generalized form of this problem (GMLogA) that emerges during the multilevel solver; each 
vertex i is assigned with a volume (or length), denoted Uj. Given the vector of all volumes, v, 
the task now is to minimize the cost 

c(g, x) = } y Wjj lg \xj - Xj\ (2) 

ij&E 

over all possible it, where Xj = ^ + 7r /fe)<7r(i) * na ^ * s ' eacn vertex is positioned at its center 
of mass, capturing a segment on the real axis that equals its length. The original form of the 
problem is the special case where all the volumes are equal. 

We denote the process of creating a weighted undirected graph Q = (V,£) by Q <— un(G), 
where G is a given directed graph. By applying un(G), we obtain V <— V and 

ij G £ 44> ij £ E 01 ji £ E . 

The edge weights are accumulated from the set of directed edges 

V ij £ £ Wij = ^2 w ij + w 3 i 

ijeE jieE 

allowing use of multigraphs. Both MLogA and GMLogA are formulated for the undirected 
graphs similarly to their directed versions. In this paper we will find approximate solutions for 
the following problem: 

Problem 2.1 Given a directed weighted network graph G, the objective is to find a permutation 
7r that minimizes c(G,x n ), where x n is a vector of the respective node coordinates restricted by 

IT. 

Note that the problem is formulated for both directed and undirected graphs. We define^ Pcx^ 
as the main unit for the empirical comparison in Section |4j 

PG,x„ = C(G,X W )/ ^ W V ■ ( 3 ) 

ijeE 



3 The algorithm 

In the multiscale framework we construct a hierarchy of decreasing-size graphs, Gq,G±, Gk, 
by coarsening, starting from the given graph Go <— un(G). In other words, we create an 
undirected version of a given directed graph by assigning to the undirected edges bigger weights 
if, in the original unweighted graph, the directed edges had been reciprocal. At the coarsest 



level Problem 2.1 is solved exactly, and starting from the (k — l)th level it is formulated and 
approximated by successive prolongation of the solution from the previous coarser level. This 
entire process is called a V -cycle (see Figure [T]). 



3.1 Coarsening 

In the present work, coarsening is interpreted as a modified process of weighted aggregation 
reinforced by the algebraic distance couplings for logarithmic sum minimization. For a detailed 
discussion about the algebraic distance-based weighted aggregation and multiscale graph orga- 
nization related to the general graph optimization problems, we refer the reader to [20J. We 
will briefly repeat its basic components for the completeness of the paper and will concentrate 
on the modifications related to Problem 12. 11 

1 j3 with no subindexes will be used where appropriate. 
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Exact solution 

Figure 1: Scheme of V-cycle. The arrows show the order in which smaller graphs are created 
and revised for approximation. 

3.1.1 Algebraic distance coupling 

Algebraic distance-based coupling is a measure of connectivity strength between two nodes 
connected by an edge [20l El]. Given the Laplacian of a graph, denoted by L = D — W, 
where TV is a weighted adjacency matrix of a graph and D is the diagonal matrix with entries 
da = Wij, we define an iteration matrix H for Jacobi over-relaxation (JOR) as 

H = {D/oj)-\{1/uj - 1)D + W z + W u ) , 

where < oo < 1 (see Appendix A) and Wi and W u are the strict lower and upper triangular 
parts of W, respectively. 

Definition 3.1 The algebraic distance coupling pij is defined as 

^• = i/(Eigixf' r) -4 M i) , 

r=l 

where x^ k ' r ^ = H k x^°' r ^ is a relaxed randomly initialized test vector (i.e., x^ '^ * s a random 
vector sampled over [-1/2, 1/2]), R is a number of initial test vectors, and k is a number of 
iterations. 

Note that this definition is a modified version of the original definition from [20^ [TO] which 



makes it more suitable for Problem 2.1 Several interesting properties (including convergence 



and model description) still can be proved similarly to [10] . 



3.1.2 Coarse graph construction 

Considering p^j as an edge strength measure, one can construct a coarse graph by defining a 
classical AMG interpolation matrix P that will project the fine graph to the coarse graph (i.e., 
to the lower-dimensional space). The projection is represented as 

L c ^PL f P T , 

where Lj and L c are the Laplacians of fine (Gf) and coarse (G c ) graphs, respectively. 

We begin by selecting a set of seed nodes C C Vf that will represent the centers of future 
coarse nodes (or aggregates). In fact, C is interpreted as a dominating set of Vf (not necessarily 
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the minimum size) such that other (fine) nodes F = Vj \ C should be strongly coupled to C. 
This can be done by traversing all nodes and identifying for every visited node i whether 



y~]pij/ ^ Pij > ©l and ^ Wjj/ ^ ?%■ > <d 2 , (4) 

jec jeVf jec jev f 

where ©(1,2) are the parameters of coupling strength (see Appendix A). The order in which 
the nodes are traversed is based on the future volume principle |20| . which is a measure of how 
large (representative in terms of the current minimization problem) a coarse node can be. To 
keep the linear complexity of the entire framework, the order need not be calculated exactly 
but only roughly (for example using bucketing sort). In contrast to the multiscale approach 
for the generalized minimum p-sum problem (20], we found that automatically moving to C 
those nodes that have exceptionally large future volume is not necessary and even has a small 
negative impact in several social networks. 

After identifying C, we define for each i £ F its coarse neighborhood Nf that contains a 
limited set of C-nodes to which i is connected. The criterion for choosing C-nodes to Nf is also 
based on p^. Let be the ordinal number in the coarse graph of the node that represents 
the aggregate around a seed whose ordinal number at the fine level is j. The classical AMG 
interpolation matrix P is defined by 



P H(j) 



( w ij/ E w ik for % € F, j E Nf 

keNf 

1 for % e C, j = i ( 5 ) 

otherwise . 



Pu(j) thus represents the likelihood of i belonging to the I(j)th aggregate. The edge connecting 
two coarse aggregates p and q is assigned with the weight w pq = Y2k^i PkpWkiPiq- The volume 
of the pth coarse aggregate is Ylj v jPjv 

3.2 Uncoarsening 



The uncoarsening process starts by solving Problem 2.1 at the coarsest level. Since the number 
of nodes at the coarsest level is very small (in our tests it was 9), the problem can be solved 
exactly by exhaustive search. 

3.2.1 Minimizing the contribution of one node 

Before proceeding to the stages of coarse-to-fine projection of a coarse-level solution, we describe 



how to approximate a solution of Problem 2.1 for a single node only. This will be a basic step 



in the initialization and relaxation described in Sections 13.2.21 and 13.2.31 

Denote by iVj the set of zth neighbors with already assigned coordinates Xj. To minimize 
the local contribution of i to the total energy ([2]) , we assign to it a coordinate Xi that minimizes 

Wijlg\xi - xj\ . (6) 

jeNi 

Since for every j G iVj, Xi = Xj implies that the sum ^ is minus infinity, we resolve this 
ambiguity by setting 

Xi = x t t = arg min } w kj lg \x k - xA . (7) 

keNi i — * 

k^jeNi 
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The trivial exact solution has a running time 0(\Ni\ 2 ), as it requires to compute |iVj| sums, 
each one with \N%\ terms. Thus, to preserve the linear complexity of the entire algorithm, one 
can use the trivial solution for nodes with small |iVj| only. We will approximate ([7]) using the 
heuristic that seeks the nearly minimum sum in the point of maximal density. 

Consider set {xj : j G iVj} as independent and identically distributed samples of a random 
variable with unknown distribution and u>ij as the posteriori probability of a sample Xj. As- 
suming that, we have to choose a point where the estimated probability density is maximized. 
Various approaches exist for density estimation |25j. One of the most popular is called the 
"Parzen window" (or kernel density estimation) method. In this method, the density at point 
x is estimated as 

^) = ^E«(^), (8) 

where K is a kernel and h is a smoothing parameter called the bandwidth. To simplify calcula- 
tions, we choose a kernel similar to the Gaussian kernel. Thus, the estimated density at point 
x is 



\Ni\h 

Note that the factor i N ,i h may be omitted without changing the maximal's location. Moreover, 

it is sufficient to calculate d(xj) for every j G N%. This approach allows us to compute the 
density in all |JVj| points in linear time using following algorithm. 



Input: xi, . . . ,Xk - samples in increasing order 
Input: h - smoothing parameter (window) 
Input: pi,...,pk - posteriory probabilities 
Output: di, . . . , dk - estimated density 

/* calculate incrementally influence of left neighbors */ 

si <- pi 

for t = 2 to k do 

I s t = pt + s t -i2^ t - i - xt y h 

end 

/* calculate incrementally influence of right neighbors */ 

rk ^Pk 

for t = k — 1 to 1 do 

| n =p t + s t+ i2^ Xt ~ Xt +^/ h 
end 

/* aggregate results */ 
for t = 1 to k do 

I dt = st + n-pt 

end 



Algorithm 1: Linear algorithm for density estimation 



Likewise, for all density estimations techniques the crucial step is to choose the bandwidth 
h. On the one hand, it should be big enough to smooth the peaks and highly-oscillatory 
components. On the other hand, if h is too big, then the maximum density will always be located 
at the middle. Based on experiments, we choose h = N/2lg(N), where N = max(xi) — min(xj). 

To justify the chosen heuristic, we present our results from experiments on real data. In 
Figure [2] we show the error distribution for our approximation. The error is calculated as 

error = 6 — 6* , 
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Figure 2: Error distribution for density approximation on real- life networks. Each curve corre- 
sponds to one network. Each point on a curve corresponds to one difference between the exact 
(quadratic) solution of [7] and its approximation. The comparisons (points) are ordered by their 
errors in an ascending order. 

where 9* is the energy (excluding — oo terms) at the exact solution of ^ and 8 is an approxi- 
mation. We can easily see that most of the time the approximation error is either zero or very 
small. 

3.2.2 Initialization 

Given is the arrangement of the coarse-level aggregates in its generalized form, where the center 
of mass of each aggregate j £ C is positioned at Xj(j\ along the real axis. We initialize the fine- 
level arrangement by letting each seed j S C inherit the position of its respective aggregate: 
xj <— Xj(jy However, in contrast to the initialization by stages (that was done for the minimum 
p-sum and the minimum workbound problems [23]) in which the F- nodes' positions are calcu- 
lated according to all already initialized fine-level neighbors, we found that the initialization 
that is inspired by the principles of the classical AMG coarse solution projection x* <— P T x c 
can produce significantly better approximations. In other words, after initializing the fine-level 
C-nodes, F- nodes will be placed one by one using Algorithm [TJ which takes into account only 
the C-neighbors of a current F-point. The entire initialization scheme is presented in Algorithm 



3.2.3 Relaxations and strict minimization 

The initialization produces a feasible (that satisfies the constraint of ([2])) solution inherited from 
the coarse level. This solution is further improved by employing a small number of relaxation 
sweeps of two types: compatible and Gauss-Seidel (GS). The goal of the compatible relaxation 
is to improve the positions of -F-nodes while keeping the C-nodes invariant. One sweep of the 
compatible relaxation can be described by the two last "for" cycles of Algorithm [2] when in the 
first cycle the best coordinate is calculated over all neighbors. 

Having the improved ordering of F-nodes relative to the C-nodes, we apply GS relaxation. 



2. 
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Input: arrangement of a coarse graph 




Output: initial ordering of fine level nodes 




/* Initialize C-points 


*/ 


for all j € C do 




1 x i <~ x i(j) 




end 




/* Initialize F-points 


*/ 


for all j G F do 




Xi <— find best coordinate for j using Aleoriothm fTl with N» 




end 




/* Legalization of the coordinates 


*/ 


for all j € V do 




| ^ = Ui/2 + E Xk<Xi v k 




end 





Algorithm 2: Initialization of the fine level 



This relaxation traverses all nodes one by one and tries to find a best position for every node 
such that its own contribution to the total sum of logarithms will be minimized. In all our 
experiments we observed that, after improving the order with the compatible relaxation, the 
GS relaxation has almost no influence on the order. This is in contrast to other multiscale 
linear ordering methods in which the GS relaxation was one of the most powerful and crucial 
components. 

3.2.4 Strict minimization 

We applied two methods of strict minimization (SM, also known as "local refinement"): node- 
by-node minimization (N-N) and window minimization (WM). The principal difference between 
relaxations and SM is that in SM each change is accepted if and only if it reduces the entire 
sum Q and not the local node contribution only ([6]). 

In the N-N minimization all nodes are scanned according to their current order, and each 
vertex i, in its turn, is checked for the best position over some small enough segment that i 
belongs to. The k left and k right candidate positions are scanned, and the one with the minimal 
cost is chosen. 

In the WM (see [23] ) , the total cost of the arrangement is reduced by numerical minimization 
of a collective contribution of a small group of consecutive nodes. Given a current feasible 
solution x of the graph logarithmic arrangement, denote by Si a small correction to X{. Denote 
by 2U = {i\ = 7r _1 (s + 1), ...,i q = tt~ 1 (s + q)} a subset of successive q vertices in the current 
arrangement. The goal of the local minimizaiton problem is then to find 5 such that 



^ Wij lg \xi + S{ — £j - Sj\ + Wij lg \£i + Si- Xj\ (9) 

Li&SS ie2B 
is minimized. The solution to this optimization problem can be approximated numerically by 



using techniques similar to 3.2.1 



The situation with strict minimization is similar to that with GS. In all our experiments, 
we found no need to use the more complicated WM. The simple N-N minimization with small 
distance parameter k < 5 was enough to obtain the best results. This is a major advantage 
because in most multiscale schemes the significant part of the running time is spent on the 
uncoarsening. The lack of need (or, more correctly, optionality) for GS and WM (or other 
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collective optimization) also strongly advocates the choice of the C-nodes because the further 
compatible relaxation solves the problem well. 

After having defined all the components of the V-cycle, we present the full scheme in Algo- 
rithm [HI 



Input: Lf is the Laplacian of undirected graph Q (initially obtained from un(G)) 
Output: approximated graph minimum logarithmic arrangement 
if Q is small enough then 
solve the problem exactly 
return the arrangement 
end 

/* Coarsening */ 

calculate algebraic distances pij, G £ 

identify the set of C-points (see Q) 

construct the fine-to-coarse projection P (see Q) 

L c ^PL f P T 

/* Recursive call */ 

II <- V-cycle(L c ) 

/* Uncoarsening */ 

7i"i initialization(n) 

7T2 <— compatible-relaxation(7T"i) 

7T3 <— GS-relaxation(7T2) 

7T4 <— N-N refinement (^3) 

return tt^ 



Algorithm 3: ms-GMLogA: Full scheme of one V-cycle for GMLogA. 



4 Computational results 

The implementation of Algorithm [3] is based on the multiscale framework used in |23| . The 
implementation is nonparallel and has not been fully optimized. The results (arrangements and 
running times) should be considered only qualitatively and can certainly be further improved 
by more advanced implementation and multiscale techniques. 

Because of the practical significance of MLogA, we designed an open site [21] with the 
benchmark graphs and a set of numerical results. In this site we present the results listed below 
and invite the scientists who work on this problem to submit interesting new networks, their 
solutions, and improved arrangements for the existing networks. 

Our benchmark consists of 100 graphs of different nature and size (most of them are taken 
from [13] and [E]). For these graphs we created their undirected versions (by applying un(-)) 
and evaluated our algorithms on both the directed and undirected versions. We evaluated two 
baseline algorithms: 



MinLA+N-N: the multiscale solver that finds an approximation of MinLA [22J followed 



by GMLogA-oriented N-N fast postprocessing (see Section [3.2.4 with k = 10); 

• ms-GMLogA: GMLogA solver described in Algorithm [3| 

Clearly, MinLA and GMLogA can have different optimal orderings. However, we observed that 
qualitative MinLA orderings reinforced by appropriate postprocessing can also lead to good 
solutions of GMLogA. In spite of the fact that most of the best solutions were obtained with 
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ms-GMLogA and because the observation was done on a benchmark of large-scale real- life graphs 
(and there exists intensive research on MinLA |14l 116] ). we decided to present these results too. 
We also mention that the Fiedler vector-based solutions (even those that try to achieve a good 
local minimum for MinLA) usually produce significantly fewer qualitative solutions even with 
appropriate postprocessing. 

In our experiments we use a WebGraph framework [5J, which provides a simple way to 
manage large graphs. In particular, the last version of WebGraph produces random, lexico- 
graphical [5], Gray [Bj, double shingle [11] , and LayeredLPA [lj orderings for the graphs. The 
important feature of these orderings is that they are completely endogenous (i.e., determined 
by the graph itself), contrary to natural ordering, that is, the ordering in which the graph was 
created. It would seem that the random ordering always has to be significantly worser, than 
the lexicographic and natural orderings. However, this is not always true, mostly because of 
the ways the latter have been produced. For example, in highly parallel systems with shared 
memory, the parallelized graph traversal algorithms can produce arrangements that have been 
dumped in parallel from different processors. Each processor produces an arrangement of good 
locality for a small subset of nodes, but the entire arrangement does not possess this property. 
In particular, in our benchmark we found two graphs whose random ordering was better than 
the natural and lexicographic ones. 

We use the average number of bits per link p ^ clS cl unit of comparison for the main results 
presented in Figures [3^a-f) and[§a-b). In both fi gures the left and right columns correspond 
to the comparison results of directed and undirected graphs, respectively. In Figures [3^ a-f), 
each plot area consists of two curves. Each point in bold curves represents a ratio between 
c(G) obtained by ms-GMLogA and min(c(G ! , n na t),c(G, TTi ex ), c(G, Tr r nd)) for a particular graph 
from the benchmark, where ir na t, ^lex: an d 7T rn d are the best orderings obtained by natural, lex- 
icographic, and randomized arrangements, respectively. Similarly, each point in regular curves 
corresponds to the ratio with MinLA+N-N in the nominator. To demonstrate the robustness of 
the algorithm, we show several sets of its parameters. Each row pair of subfigures corresponds 
to the same set of parameters for directed and undirected graphs, respectively. 

In the multiscale algorithms, there are several possible sources (see [9]) of potentially higher 
than linear complexity (or linear with coefficients that are too big). Here we address the four 
most important: (a) the number of iterations in the relaxations/refinements, (b) the complexity 
of one iteration of relaxation/refinement, (c) the order of interpolation that can increase the 
complexity of the coarse graphs, and (d) too many C-nodes (addressed in Appendix A). The 
first set of parameters (Figures |3^a,b)) (which is a suggested default set) contains a mild config- 
uration. The order of interpolation (number of nonzero entries in one row of P) is only 1; k in 
N-N refinement is 5; and the number of relaxations of any kind is at most 20. The calculation 
of the algebraic distance couplings in this set is based on 5 randomly initialized vectors and 
only 20 iterations of JOR. For almost all graphs, we observed that ms-MLogA produces better 
orderings than does MinLA+N-N. On average, ms-MLogA improves the graphs by more than 
40% of their ordering cost in comparison to natural, lexicographic, and random arrangements. 

We note that the most beneficial graphs (those that have ratios from 0.2 to 0.4) in both 
directed and undirected versions come from different collections such as VLSI design networks, 
part of Web network, road maps, and Amazon links. In general, these graphs differ significantly 
in their structural properties. We observed that no particular part of the benchmark was less 
beneficial than another, thus attesting to the generality of the proposed method. 

The difference between improvements obtained with different parameter sets is not signif- 
icant; however, we provide them to demonstrate the robustness of the method. In the second 
set of parameters (Figure [3^c,d)) the number of random initial vectors in the algebraic distance 
coupling is reduced to 1 only. It leads to the faster coarsening (while keeping all convergence 
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and model properties of the algebraic distance [10]) and potentially weaker decisions regarding 
edge strengths based on the algebraic distance. In the third set of parameters (Figure [3][|e,f)) 
we increase k in N-N refinement from 5 to 10 and the interpolation order from 1 to 2 at the 
finest level of hierarchy. At all coarse levels these parameters are increased too, according to 
the logarithmic increase scale described in |22j . 

In Figure |4]^a,b) we show the difference between the very fast version of ms-GMLogA with 
no refinement and fast relaxation (5 sweeps) and the slowest version with very aggressive re- 
laxation/refinement parameters (k = 25 and 40 sweeps). Each subfigure contains two curves: 
regular and bold. The regular corresponds to the fast version and the bold to the slowest one. 
It is easy to see that the difference between these two versions is not significant and one can 
certainly use a fastest version to obtain qualitative results. 

The Gray [6] and double shingle |TT] orderings have proven themselves as more successful 
heuristics than lexicographic and natural orderings [23] and, thus, we present their comparison 
with ms-GMLogA in separate Figure [5^ a-b). Two curves depicted in Figure [5^ a) correspond 
to the comparison of Gray ordering on directed (regular curve) and undirected (bold curve) 
graphs, respectively. Each point on the curves corresponds to the ratio between f3 values of ms- 
GMLogA with default parameters and Gray ordering, respectively. Similarly, the comparison of 
double shingle ordering |11] is shown in Figure [5^b). The (double) shingle heuristic is based on 
the node similarity ordering derived from estimation of Jaccard coefficients. In [11] , the authors 
prove that (double) shingle heuristic will be beneficial for networks that can be described by 
the preferential attachment model. 

Recently, the layered label propagation algorithm (LayeredLPA) has been introduced in 
[1]. This propagation method is based on the Potts model |19j . This algorithm is significantly 
more successful than natural, random, lexicographic, Gray and (double) shingle orderings. The 
success of the label propagation methods has a similar nature to the success of the algebraic 
distance coupling (see Section 3.1.1 and [10]) in which the propagation and averaging of random 
values over the node neighborhoods is employed. However, our multiscale method shows better 
results (see Figure [6]) . Note that the LayeredLPA is introduced for undirected graphs only. 
We believe, that introducing the AMG-based framework to the label propagation model can 
significantly improve its quality. 



4.1 Scalability of ms-GMLogA 

Good scalability is one of the most important advantages of the multiscale algorithms. All 
components of our scheme are of linear complexity, and the total complexity is 0(|U| + \E\). 
The dependence of the running time on the graph size is depicted in Figure [7] Each small 
circle in the figure corresponds to a particular graph. We added to this figure a regression line 
whose slope is close to 1. The dependence is presented in logarithmic scale. We do not expect 
a more precise relationship between \V\ + \E\ and the running time since the implementation is 
far from being optimized and since it may depend also on structural factors such as the degree 
distribution of the graph and its diameter. 



4.2 Spectral approach 

Arrangement of the graph vertices according to the eigenvector (called the Fiedler vector) corre- 
sponding to the second smallest eigenvalue of the graph Laplacian is a well-known and successful 
heuristic used for many ordering problems such as the minimum p-sum and the minimum en- 
velope reduction (see [H [IS]). Usually, for large graphs the computation of the Fiedler vector 
is too expensive, and some approximation is employed [21] • Most of these approximations can 
be viewed as a global averaging process that works until a particular convergence. As a result, 



11 




40 60 
ordered graphs 




40 60 80 

ordered graphs 



(a) directed 



(b) undirected 




40 60 80 

ordered graphs 




40 60 80 

ordered graphs 



(c) directed 



(d) undirected 




20 40 60 

ordered graphs 




40 60 
ordered graphs 



(e) directed 



(f) undirected 



Figure 3: Main comparison results. Notations 'lg' and 'la' correspond to the reults of ms- 
GMLogA and MinLA+N-N, respectively. 
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Figure 5: Comparison of Gray and double shingle orderings versus ms-GMLogA. 
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Figure 6: Comparison of LayeredLPA versus ms-GMLogA. 
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Figure 7: ms-GMLogA: running time vs graph size. 
Table 1: Comparison of spectral method and ms-GMLogA 



Network 


Spectral 


ms-GMLogA 


as-caida20071105 


12.4471 


6.32 


email-Enron 


10.3117 


7.06 


oregonl_010407 


11.0662 


6.18 


p2p-Gnutella06 


10.6513 


7.13 


wiki-Vote 


9.2152 


7.41 



during this process the vertex tends to be located at the weighted average of its neighbors, 
which makes it suitable for solving the quadratic functionals such as the minimum 2-sum prob- 
lem [15] but creates a poor solution for the sum of logarithms. Several examples (from |17j ) of 
a comparison between spectral approach and ms-GMLogA are presented in Table [TJ 

4.3 Compressing the ordered networks 

In [11], Chierichetti et al. conjectured that minimizing the unweighted version of MLogA will 
automaticaly improve the graph compression. The conjecture has been supported by experi- 
mental observations on few web graphs and social networks. However, for general networks, 
a better ordering does not always imply a better compression ratio produced by the kind of 
compressions described in [5] . To illustrate that we present in Figure [8] a dependence of order 
improvement on the compression produced in [5]. 

Given an initial ordering (natural or Gray), we apply our algorithm to obtain a new ordering 
with smaller (3. The x-axis corresponds to the ratio between f3 values of initial ordering and 
ms-GMLogA. The y-axis corresponds to the ratio obtained by compressing these orders. On 
most of the graphs, the conjecture is confirmed. However, on some graphs we can see the 
degradation of the compression results. It is observable, in particular, in Figure [8^b) , where the 
initial arrangement was obtained by Gray ordering that is significantly better than the natural 
ordering. 
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Figure 8: Correlation between ordering and compression improvements 



5 Conclusions and future work 

We have proposed a fast linear algorithm (ms-GMLogA), for compression-friendly graph reorder- 
ing. The algorithm belongs to the family of multiscale methods |9j. It uses a novel AMG-based 
coarsening scheme that is reinforced by a modification of algebraic distance couplings [20J. The 
empirical results on a large benchmark demonstrate its quality and scalability. The model takes 
into account the edge and node weights that can express different properties of a network model 
such as known link importance or its access frequency. Overall, we recommend this multiscale 
framework for practical, compression-friendly network orderings. 

As future work we identify the following research directions that have the potential to 
improve ms-GMLogA: 

• development of a learning algorithm to improve the bandwidth h in Q; 

• development of a more sophisticated collective refinement of nodes; 

• development of a multiscale method for MLinGapA (defined below); and 

• more sophisticated multiscale organization of directed graphs. 

A more sophisticated collective refinement of nodes can allow interruption of the coarsening 
earlier. Thus, if some more qualitative approximation can be achieved at the very coarse levels 
instead of solving exactly the coarsest and applying fast simple refinement at the next few finer 
level, it can potentially lead to better solutions at the fine levels. 

The next natural step is to design first a refinement and then a full multiscale approach for 
the minimum logarithmic gap arrangement problem (MLinGapA) defined in JTjLJ as an alternative 
model for compression. In this problem the goal is to find a permutation of nodes that minimizes 



where /^(i, out(i)) is a sum of logarithms of gaps between consecutive neighbors of i ordered 
by 7r. This problem is also NP-hard [11] ; however, we believe that having a suitable refinement 
and relaxation algorithms would make it possible to adopt the multiscale framework for this 
problem as well. 

The last research direction is related to finding a better multiscale scheme for directed 
graphs. The presented model constructs an undirected edge ij from a given set of directed 




(10) 
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edges between i and j by reflecting both directions as an edge weight. In fact, this way of 
representation is not far from the attempts to solve the problems on nonsymmetric matrix A 
by applying the known techniques on A + A' or A ■ A' . It is known that these methods suffer 
from several drawbacks, however; thus, we identify a finding of a more advanced multiscale 
representation for directed graphs as one of the major future research directions. 
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Appendix A: Parameters 

01,2- The threshold parameters ©i and O2 from Q are responsible for a control of complexity of 
the coarse level problem and the quality of coarsening. It is important to check their robustness 
when designing a multiscale framework. The presented computational experiments have been 
executed with ©1 = 62 = 1/2, however, other values .2 < ©1, ©2 < .8 exhibit a good behavior 
of numerical results as well. Of course, the larger values increase the running time as the coarse 
graphs become bigger. 

u). Detailed discussion about choosing parameter oj for Jacobi over-relaxation on general 
graphs is discussed in |1U| . In all our computational experiments this parameter was 0.5. 
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